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IDENTIFICATION OF SELF AND NO N-SELF ANTICFNS 
JMPUCATFP IN AUTOIMMUNE DISEASE 

5 Field of the Invention 

The present invention relates to the field of immunology and, in particular, to the 
identification of self and non-self antigens implicated in human autoimmune responses. The 
invention relates to methods of identifying such self and non-self antigens and provides examples 
of such antigens relating to multiple sclerosis and pemphigus vulgaris. The invention also relates 
10 to the use of such antigens for in vitro assays, animal models, therapeutic agents and vaccines. 

Backgro und of the Invention 

Human autoimmune diseases have a striking genetic association with particular alleles of 
major histocompatability complex ("MHC") class 1 or class II genes. The field was established 

15 by the seminal discovery of HLA-B27 linked susceptibility to ankylosing spondylitis, a chronic 
inflammatory joint disease (Brewerton et ah, 1973; Schlosstein et ah. 1973). MHC associated 
susceptibility has now been documented for a variety of human autoimmune diseases, including 
insulin dependent diabetes mellitus (IDDM), rheumatoid arthritis (RA), pemphigus vulgaris 
(PV). multiple sclerosis (MS) and myasthenia gravis (MG). just to name a few (Todd et al.. 

20 1987: Ahmed etal.. 1990: Ahmed et al. 1991: Lanchbury & Panayi. 1991: Spielman & 
Nathenson. 1982: Protti et al.. 1993). 

The MHC class II locus most commonly associated with autoimmune disease is the 
HLA-DRJJ locus (also known as DRB1 ). a highly polymorphic locus with over fifty known 
alleles. For example, a large body of epidemiological work has documented the association of 

25 rheumatoid arthritis with the DR4 (DRB1*040L DRB 1*0404) and DR1 (DRB1*0101 ) alleles, 
with the DR4 alleles conferring a higher risk than DR1 (Lanchbury & Panayi. 1 991 ). The risk is 
dramatically increased when the subject is homozygous or heterozygous for DRB 1*0401 and/or 
DRB1 *0404. The observation that arthritis is associated with three DR alleles that are 
structurally similar led to the development of the 'shared epitope* hypothesis as DRB 1*0401. 

30 0404 and 0101 share critical polymorphic residues in the DRB 67-71 cluster (Gregersen et al. 
1987: Lanchbury & Panayi. 1991). These residues (in particular DRB 71 ) appear to be critical in 
defining the selectivity of peptide binding to the disease associated molecules. 



W096/2738? PCT/US96/03182 

- 2 - 

Pemphigus vulgaris is an autoimmune disease of the skin in which high titer autoantibody 
production to an epidermal cell adhesion molecule (desmoglein 3) results in a loss of 
keratinocyte adhesion (acantholysis) and severe blister formation (Amagai et aL 1991 ). In 
different ethnic groups the disease is associated either with a DR4 allele (DRB 1*0402) or with a 
rare DQ1 allele (DQB 1*05032); only a small fraction of PV patients have neither susceptibility 
gene (Ahmed etal.J 991; Ahmed et aL 1990: Scharf et aL 1988b). The DR4 subtype 
associated with pemphigus differs only at three positions in the DRfi 67-71 cluster from the DR4 
subtype associated with RA. The PV associated molecule has a negative charge (Glu) at the 
critical position (DRfi 7 1 ); the neighboring position (DRB 70) is also negatively charged. The 
DR4 subtype associated with PV is the only one that carries a negative charge at DRfi 71 ; a 
positive charge (Arg) is found at DRfi 71 in the RA associated DR4 molecules. 

With respect to multiple sclerosis, recent immunological studies suggest that myelin basic 
protein (MBP) may be one of the important target antigens in the immunopathogenesis of the 
disease. Several studies have demonstrated that MBP specific T cells are clonally expanded in 
MS patients and in an in viva activated state (Allegretta et aL 1 990; Wucherpfennig et aL 
1994b; Zhang et ah, 1994). Reactivity with the immunodominant MBP(84-102) peptide is found 
predominantly in subjects carrying HLA-DR2 (the most common subtype of which is 
DRB1 * 1 501). a genetic marker for susceptibility to MS. The MBP(84-102) epitope can also be 
presented by other MHC class II antigens, including HLA-DQ1 (Ota et aL 1990: Martin et aL 
1 990; Pette et aL 1 990; Wucherpfennig et aL 1 994a). in vivo, the T cell response to this peptide 
appears to be dominated by a few expanded clones. 

While associations between MHC alleles and disease states have implicated autoimmunity 
in the aetiology of these diseases, a large body of clinical and epidemiological evidence suggests 
that infections may be important in the induction of autoimmunity. For example, particular viral 
infections frequently precede autoimmune myocarditis and type I diabetes (IDDM) (Rose et aL 
1986; Ray et aL 1980). Environmental agents also influence the risk of developing multiple 
sclerosis as demonstrated by migration studies. Individuals that migrate after age 1 5 carry the 
risk for developing MS associated with their geographic origin while individuals who migrate 
earlier in life acquire the risk of the geographical region to which they migrated (Kurtzke. 1985). 
These studies are consistent with the hypothesis that a group of pathogens that are relatively 
ubiquitous in a certain geographic region influence the risk of developing multiple sclerosis 
(MS ). The mechanism(s) leading to clonal expansion of MBP reactive T cells remain to be 
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identified but could involve recognition of viral peptides with sufficient structural similarity to 
the immunodominant MBP peptide. The initiation of autoimmunity by such a mechanism could 
then lead to sensitization to other CNS self antigens by determinant spreading (Lehmann et aL 
1992; Kaufman et al. 1993; Tisch et aL 1993). Consonant with this hypothesis, it has been noted 
5 that inflammatory CNS disease can follow infection with a number of common viral pathogens, 
such as measles and rubella. On the other hand, the absence of virus in the CNS of these patients 
and reactivity to myelin basic protein in these patients suggest an autoimmune mechanism 
(Johnson et al.. 1984). 

Efforts to identify sequence homologies between self peptide epitopes that might be 

10 involved in autoimmunity and various bacterial and viral pathogens have therefore been made. 
These homology searches have focused on alignments with sequence identity. No success has 
been reported using such alignments in identifying epitopes from pathogens that could cross 
react with presumably pathogenic T cell lines from human patients with autoimmune disease 
(Oldstone, 1990). A sequence identity was recently found between an epitope in a Coxsackie 

15 virus protein and GAD65, suspected of being an autoantigen in diabetes. These peptides could 
reciprocally generate polyclonal T cell lines from mice that cross react with the other peptides 
(Tian. et al.. 1994). No evidence, however, was provided that these peptides could stimulate 
clones from diabetic mice (or humans). 

Recent developments in the field, in particular the identification of allele specific peptide 

20 binding motifs have transformed the field (Madden et aL 1991 : Rotschke & Falk. 1 991 ). Based 
on this knowledge the structural basis for MHC linked susceptibility to autoimmune diseases can 
be reassessed at a level of detail sufficient for solving longstanding questions in the field. Motifs 
for peptide binding to several MHC class I and class II molecules have been defined by sequence 
analysis of naturally processed peptides and by mutational analysis of known epitopes. MHC 

25 class I bound peptides were found to be short (generally 8-10 amino acids long) and to possess 
two dominant MHC anchor residues; MHC class II bound peptides were found to be longer and 
more heterogeneous in size (Madden et aL 1991 : Rotschke & Falk. 1991 ; Jardetzky et al. 1991 . 
Chicz et al. 1992. 1993). Due to the size heterogeneity, however, it has proven more difficult to 
define MHC class II binding motifs based on sequence alignments. More recently, a crystal 

30 structure for HLA-DR1 demonstrated that there is a dominant hydrophobic anchor residue close 
to the N -terminus of the peptide and that secondary anchor residues are found at several other 
peptide positions (Brown et al.. 1993). Even this work, however, could not provide a detailed 
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description of the binding pockets of HLA-DR proteins, the particular residues involved in the 
formation of these pockets or the structural requirements or antigens for MHC binding. 

In the present disclosure, a detailed description of the HLA-DR antigen binding pockets is 
provided (Stern et al.. 1994). With this information, together with functional information 
5 defining those amino acids of the self or non-self antigen that are needed for MHC binding and 
TCR contact (e.g., Wucherpfennig et al. 1994. 1995). binding motifs for the various HLA-DR 
allotypes may be developed, self epitopes involved in autoimmune disease may be identified and 
a method is provided for identifying bacterial and viral epitopes which may initiate a human 
autoimmune response. 

10 

Summary of the Invention 

The present invention provides, in one aspect, seven different isolated polypeptides 
derived from the human desmoglein 3 protein and implicated as self epitopes in the autoimmune 
disease pemphigus vulgaris (PV). These peptides consist essentially of the seven amino acid 
15 sequences disclosed herein and designated SEQ ID NO.: 1 through and SEQ ID NO.: 7. In 
particular, the invention provides isolated peptides which consist of these sequences, the core 
MHC binding residues of these sequences, or the inner core MHC binding residues of these 
sequences. 

The present invention provides, in another aspect, eight different isolated polypeptides 
20 derived from human pathogens but implicated in the aetiology of the autoimmune disease 

multiple sclerosis. These peptides consist essentially of the eight amino acid sequences disclosed 
herein and designated SEQ ID NO.: 8 through SEQ ID NO.: 1 5. In particular, the invention 
provides isolated peptides which consist of these sequences, the core MHC binding residues of 
these sequences, or the inner core MHC binding residues of these sequences. 
25 In another set of embodiments, the invention provides for pharmaceutical preparations for 

use in tolerizing individuals to autoantigens. The preparations include a pharmaceutical^ 
acceptable carrier and an isolated human polypeptide which includes an amino acid sequence 
corresponding to a sequence motif for an MHC class II protein which is associated with a human 
autoimmune disease. These polypeptides are capable of binding to the protein to form a complex 
30 which activates autoreactive T cells in subjects having the autoimmune disease. The peptides are 
not derived from human collagen or human myelin basic protein. The protein preferably is an 
HLA-DR protein. 



WO 96/27387 PCTAJS96/03182 

- 5 - 

In particular embodiments, such pharmaceutical preparations are provided in which the 
HLA-DR protein is HLA-DR4 or HLA-DQ1 protein and the autoimmune disease is pemphigus 
vulgaris. In addition, a particular sequence motif is provided for pemphigus vulgaris and 
pharmaceuticals having peptides with this motif are provided. Specific embodiments of the 
5 pharmaceuticals include each of the polypeptides described above with respect to pemphigus 
vulgaris. Thus, methods of tolerizing an individual to a pemphigus vulgaris autoantigen are also 
provided. 

In another set of embodiments, the invention provides for pharmaceutical preparations for 
use in tolerizing individuals to antigens of human pathogens which are implicated in human 

10 autoimmune disease. The preparations include a pharmaceutical^ acceptable carrier and an 

isolated human pathogen polypeptide which includes an amino acid sequence corresponding to a 
sequence motif for a MHC class II protein, such as an HLA-DR protein, which is associated with 
a human autoimmune disease. These polypeptides are capable of binding to the protein to form a 
complex which activates autoreactive T cells in subjects having the autoimmune disease. 

15 In particular embodiments, such pharmaceutical preparations are provided in which the 

protein is HLA-DR2 protein and the autoimmune disease is multiple sclerosis. In addition, three 
particular sequence motifs are provided for multiple sclerosis and pharmaceuticals having 
peptides with at least one of these motifs are provided. Specific embodiments of the 
pharmaceuticals include each of the polypeptides described above with respect to multiple 

20 sclerosis. Thus, methods of tolerizing an individual to a multiple sclerosis foreign antigen are 
also provided. 

In another aspect of the invention, pharmaceuticals are provided for vaccination against a 
human pathogen implicated in the aetiology of autoimmune disease. These pharmaceutical 
preparations include a pharmaceutical^ acceptable carrier and an immunogenic preparation 

25 effective to immunize against a human pathogen. The human pathogen is one which in its native 
form includes a polypeptide having an amino acid sequence corresponding to a sequence motif 
for a MHC class II protein, such as an HLA-DR protein, which is associated with the 
autoimmune disease. These polypeptides are capable of binding to the protein to form a complex 
which activates T cells which become autoreactive and intiate the autoimmune disease. The 

30 preparations of the present invention specifically do not include such polypeptides but. rather, 
include other antigens from the pathogen. 
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In particular embodiments, such pharmaceutical preparations are provided in which the 
protein is HLA-DR4 protein and the autoimmune disease is pemphigus vulgaris. In addition, a 
particular sequence motif is provided for pemphigus vulgaris and pharmaceuticals which lack 
peptides having this motif are provided. Specific embodiments of the pharmaceuticals include 

5 preparations lacking each of the polypeptides described above with respect to pemphigus 
vulgaris. Thus, methods of immunizing an individual against pathogens which may cause 
pemphigus vulgaris are also provided. 

Similarly, pharmaceutical preparations are provided in which the protein is HLA-DR2 
protein and the autoimmune disease is multiple sclerosis. Three particular sequence motifs are 

10 provided for multiple sclerosis and pharmaceuticals which lack peptides having any of these 
motifs are provided. Specific embodiments of the pharmaceuticals include preparations lacking 
each of the polypeptides described above with respect to multiple sclerosis. Thus, methods of 
immunizing an individual against pathogens which may cause multiple sclerosis are also 
provided. 

15 The pharmaceutical preparations for immunizing against pathogens that may cause 

multiple sclerosis may particularly include inactivated forms of the pathogen in which the 
polypeptide corresponding to the motif are removed or recombinantly altered. Thus, the 
invention provides such vaccines in which the pathogen and polypeptide are. respectively. 
Herpes simplex virus and UL1 5 protein, Herpes simplex virus and SEQ ID NO.: 8. Adenovirus 

20 and Adenovirus ORF protein. Adenovirus and SEQ ID NO.: 09. Pseudomonas aeruginosa and 
phosphomannomutase protein. Pseudomonas aeruginosa and SEQ ID NO.: 10, Papillomavirus 
and L2 protein. Papillomavirus and SEQ ID NO.: 1 1 , Epstein-Barr virus and DNA polymerase 
protein. Epstein-Barr virus and SEQ ID NO.: 1 2. Influenza virus and hemagglutinin protein. 
Influenza virus and SEQ ID NO.: 1 3. Reovirus and sigma 2 protein. Reovirus sand SEQ ID NO.: 

25 14, Herpes simplex virus and DNA polymerase, and Herpes simplex and SEQ ID NO.: 1 5. 

The present invention also provides general methods for evaluating a peptide for an ability 
to induce an autoimmune response. These methods involve choosing an MHC class II molecule 
associated with the autoimmune response, selecting at least two major MHC binding pockets of 
the molecule, identifying sets of amino acid residues which bind within each of the selected 

30 pockets, developing a sequence motif for the molecule in which the sets of amino acids define 
the allowed amino acids at the corresponding positions of the motif, and then comparing the 
amino acid sequence of the peptide to the sequence motif Peptides which match the motif have 
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a much greater likelihood of inducing the autoimmune disease. In addition, if there is a known 
epitope implicated in the disease, the method may further include selected at least one TCR 
contact residue of the epitope, identifying a set of amino acid residues which may serve as the 
TCR contact, and including this set in the motif at the appropriate position. In preferred 
5 embodiments, the molecule is an HLA-DR protein and the motifs include restrictions on the 
residues at positions corresponding to at least the PI MHC binding pocket and at least one of the 
P4 and P6 pockets. 

In another embodiment of the invention, methods are provided specifically for identifying 
foreign antigens implicated in human autoimmune response. These methods include the same 

1 0 steps as the previously described methods, but further include a comparison of the resulting 
sequence motif to sets of human pathogens. In preferred embodiments, peptide sequences from 
one or more species in the normal human intestinal flora are excluded from consideration. In 
another preferred embodiment, sequences from one or more species of pathogen which is 
negatively correlated with the incidence of the disease are excluded. In a most preferred 

1 5 embodiment, the human pathogen peptides are searched and evaluated on a computer database 
using the motif as a search criterion. 

Brief Description of t he Drawing s 

Figure 1 illustrates the pockets in the HLA-DR 1 peptide binding. The upper right shows a 
20 top view of the molecular surface ( 1 .5 Angstrom probe radius) of the HLA-DR1 peptide-binding 
site with the Ca trace of the HLA-DR 1 shown as thin lines. Pockets that accomodate the peptide 
side chains of an antigen (in this case Influenza virus hemagglutinin) are shown in detail in 
surrounding views, and are numbered as described herein. The PI pocket accomodates the Tyr 
(308) of the antigen. The P4. P6. P7 and P9 pockets bind antigen residues Gin (311), Thr (313). 
25 Leu (314) and Leu (3 1 6) respectively. Antigen peptide side chains and the nearby peptide main 
chain are represented by CPK models and HLA-DR side chains in contact with the peptide are 
indicated as stick models. Pockets are viewed in the plane of the peptide binding site, towards 
the N terminus of the peptide (PI and P6). towards the C terminus of the peptide (P4). towards 
the Bj helical region (P7) or towards the ctj helical region (P9). A full color version of this figure 
30 may be found in Stern et al.. 1994. 
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Detailed Description of the Invention 

The present invention relates to a method of identifying and evaluating peptides for their 
ability to induce an autoimmune response or to cause autoimmune disease. In particular the 
invention relates to methods of (1 ) evaluating self peptides for their potential involvement in 
autoimmune disease when the self epitope or autoantigen is unknown and (2) evaluating foreign 
peptides for their possible involvement in the aetiology of autoimmune disease. The invention 
also relates to specific peptides identified by the methods of the invention and representing self 
and foreign antigens implicated in. respectively, pemphigus vulgaris and multiple sclerosis. 

The methods rely upon the development of amino acid sequence motifs to which potential 

self or foreign epitopes may be compared. Each motif describes a finite set of amino acid 

sequences in which the residues at each (relative) position may be (a) restricted to a single 

residue, (b) allowed to vary amongst a restricted set of residues, or (c) allowed to vary amongst 

all possible residues. For consistency in this disclosure, but without limiting the invention in any 

way, these sequence motifs will be symbolized as strings of characters in which (a) a position 

which is restricted to a single residue will be represented by the one-letter abbreviation for that 

residue, (b) a position which is allowed to vary amongst a set of residues will be represented by a 

column of the one-letter abbreviations for those residues, and (c) a position which is allowed to 

vary amongst all amino acid residues will be represented by an M X." As an example only, a motif 

might specify that the residue at a first position may be any one of the residues valine. leucine. 

isoleucine. methionine, or phenylalanine; that the residue at the second position must be 

histidine; that the residue at the third position may be any amino acid residue: that the residue at 

the fourth position may be any one of the residues valine, leucine, isoleucine. methionine. 

phenylalanine, tyrosine or tryptophan; and that the residue at the fifth position must be lysine. 

Such a motif would be represented by the following character string: 

V H X V K 
L L 
I 1 
M M 
F F 

Y 

W 

In one aspect of the present invention, sequence motifs are developed by analysis of the 
binding domains or binding pockets of major histocompatibility complex HL A-DR proteins 
and/or the T cell receptor ("TCR") contact points of epitopes bound to MHC class II molecules. 
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By providing a detailed structural analysis of the residues involved in forming the MHC class 
II binding pockets, one is enabled to make predictions of sequence motifs for binding to any of 
the MHC class II proteins. 

In another aspect of the present invention, sequence motifs developed by the methods 
5 disclosed herein may be used to identify self peptide epitopes involved in an autoimmune 
response when the autoantigen is known or suspected. 

In another aspect of the present invention, methods of identifying foreign peptide epitopes 
implicated in autoimmune disease are provided. These methods involve the use of MHC and/or 
TCR binding motifs to identify peptides derived from certain classes of organisms or pathogens 

10 which may initiate human autoimmune response. In this aspect the motifs may be developed 
according to the methods of the present invention or by other means known in the art. 

Using these sequence motifs as search, evaluation, or design criteria, one is enabled to 
identify classes of peptides which have a reasonable likelihood of binding to a particular MHC 
molecule and of interacting with a T cell receptor to induce T cell and/or autoimmune response. 

15 Use of these motifs, as opposed to pure sequence homology (which excludes many peptides 
which are antigenically similar but quite distinct in sequence) or sequence homology with 
unlimited "conservative" substitutions (which admits many peptides which differ at critical 
highly conserved sites), represents a significant advance in the ability of one of ordinary skill in 
the art to evaluate particular peptides for potential involvement in autoimmune disease and to 

20 search computer databases of peptide sequences to identify self and foreign peptides which may 
be involved in autoimmune response. In addition, the use of MHC class II and/or TCR binding 
motifs to search limited databases for foreign peptides which may be implicated in the aetiology 
of autoimmune disease is a novel application of the concept of binding motifs. 

Detailed examples of the practice of the present invention are presented below. The 

25 methods of the invention have now been used to identify the previously unknown self peptide 
epitopes involved in the autoimmune disease pemphigus vulgaris. In addition, the methods have 
been used to identify a class of foreign peptides which may be involved in the initiation of 
autoimmune response in multiple sclerosis (for which the self epitope had previously been 
identified). 

30 Thus, in another embodiment, the invention provides these peptides, in isolated form. 

which may be used in various diagnostic and therapeutic methods and products alluded to below. 
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I. The MHC Cla ss II HLA-DR Molecular Mimicry Motif 

The HLA-DR binding site is characterized by five major pockets which may bind the 
amino acid side chains of antigens (Stem et al., 1 994, the entire disclosure of which is 
incorporated herein by reference). See Figure 1 . The amino acid residue of the antigen which 
5 binds in the first major pocket is designated PI . The remaining residues may then be numbered 
by their positions relative to PI (with positive numbers increasing toward the carboxy terminus 
and negative numbers increasing toward the amino terminus): 

P-i P-l PI P2 P3 P4...PJ 

10 

Thus, the first major pocket of an HLA-DR molecule, by definition, binds the side chain of 
residue PI on an antigen. The remaining major pockets bind residues P4. P6, P7 and P9. These 
residues are defined as the major MHC contact residues. 

The amino acid side chains of residues P-K P2, P3. P5. P8, and PI 1 are oriented away 
1 5 from the HLA-DR binding site and. therefore, are available as contact residues for a T cell 
receptor (TCR). All of these residues are defined as TCR contact residues. 

A. Thy MHC CPPtact Revues 

The first major pocket of the HLA-DR molecule is strongly hydrophobic. It is formed by a 
stretch of residues at about positions 85. 86, 89 and 90 of the B chain, a stretch of residues at 

20 about positions 31,32 and 34 of the a chain, and side chains from residues at about positions 7 
and 43 of the a chain. For example, in HLA-DR 1 (DRA. DRB1*0101 ). the first pocket is 
formed by residues B85 (Val), B86 (Gly), B89 (Phe), B90 (Thr), ex31 (lie). a32 (Phe). a34 (Phe). 
a7 (He), and a43 (Trp). The corresponding residues for other HLA-DR alleles are known in the 
art (see. e.g.. Marsh and Bodmer, 1992. incorporated by reference herein) and are available 

25 through genetic databases. 

Although most of the residues that shape the PI pocket are from the highly conserved DRa 
chain, the size and nature of this pocket varies due to polymorphisms in the B chain residues 
involved in the pocket. For the DRB1*0101 protein, the pocket is large and hydrophobic and 
can accommodate any of the aliphatic or aromatic residues. Polymorphism at the B residues. 

30 however, may alter the binding capacity of the PI pocket. For example, the B86 residue is 
known to be polymorphic. Most commonly, this site is occupied by either Gly or Val. 
Generally, when Gly is present at B86 (as in DRB1 *0101 ). any of the aliphatic or aromatic 
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residues may bind within the pocket. When Val is present, however, the pocket is smaller and 
Tyr and Trp cannot be accommodated. Thus, when B86 is Gly. position PI of the molecular 
mimicry motif may consist of residues chosen from V, L, I, A. M. F, Y, W and when B86 is Val. 
position PI of the motif may consist of residues chosen from V, L, I. A, M, F. Similar 
5 considerations apply to the other B residues of the PI pocket. 

The P4 pocket of HLA-DR molecules is also a relatively large, shallow, hydrophobic 
pocket oriented across the antigen binding site. This pocket can bind a variety of large aliphatic 
side chains which can maintain hydrophobic interactions along the side and floor of the pocket. 
The pocket is formed by a stretch of residues at about positions 70, 71 , 74 and 78 of the B chain. 

10 and side chains from residues at about position 13 of the B chain and about position 9 of the a 
chain. For example, in HLA-DR 1 (DRA. DRB 1*0101 ). the P4 pocket is formed by residues B70 
(Gin), B71 (Arg). B74 (Ala). B78 (Tyr). B13 (Phe). and a9 (Gin). The corresponding residues for 
other HLA-DR alleles are known in the art (see. e.g.. Marsh and Bodmer, 1 992) and are available 
through genetic databases. 

1 5 Like the PI pocket, the P4 pocket is largely hydrophobic but its binding capacity is 

affected by polymorphisms at the B residues involved in the pocket. For example, different DR 
alleles have differently charged residues at position B71 : In DRB1 *0404. B71 is occupied by a 
positively charged Arg residue whereas in DRB1*0402 B71 is a negatively charged Glu residue. 
Thus, although this pocket can generally bind a variety of aliphatic or aromatic side chains (e.g.. 

20 V ? L, I. A. M, F. Y. W), positively charged P4 antigen residues are disfavored when B71 is 

positively charged and. similarly, negatively charged P4 residues are disfavored when B71 is also 
negative. Similar considerations apply to the other B residues of the P4 pocket. Note that some 
residues may be involved in forming each of two adjacent pockets (e.g.. Bl 3 in the P4 and P6 
pockets) and therefore the occupancy of one of these pockets by a particular amino acid may 

25 influence the occupancy of the other. 

The P6 pocket of HLA-DR molecules is a relatively shallow pocket with a preference for 
smaller (e.g.. A. G) P6 antigen residues. The pocket is formed by the highly conserved al 1 . 
a62. a65 and a6 residues and the highly polymorphic Bl 1 and B13 residues of the HLA-DR 
protein. For example, in HLA-DR1 (DRA. DRB1 *0101 K the P6 pocket is formed by residues 

30 al (Glu). a2 (Asn). a5 (Val). cc6 (Asp), Bl 1 (Leu) and B13 (Phe). The corresponding residues 
for other HLA-DR alleles are known in the art (see, e.g.. Marsh and Bodmer. 1992) and are 
available through genetic databases. 
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Although there are only two B chain residues in the P6 pocket, they vary widely amongst 
the DR alleles. With a large Phe residue at Bl 3 (as in DRB1 *01 01 ). the P6 residue is preferably 
one of the small residues (e.g.. A, G). In other DR alleles, however, B13 is occupied by smaller 
or more polar residues such as the B13 (His) of DRB1 *0401 . For such alleles, the P6 motif may 
5 include somewhat larger and polar residues (e.g., S, T, V) but should still avoid the largest and 
aromatic residues. Finally, in some alleles, Bl 1 and B13 are both serine residues (e.g., 
DRB1* 1101) and for these cases more hydro phi lie or hydrogen bonding residues may be 
included in the motif. 

The P7 pocket of HLA-DR molecules is also a relatively shallow pocket. The pocket is 
10 formed by five residues of the B chain: B28, B47, B61, B67 and 671 . For example, in HLA-DR 1 
(DRA, DRB*0101), the P7 pocket is formed by residues B28 (Glu), B47 (Tyr), B61 (Trp), B67 
(Leu) and B71 (Arg). The corresponding residues for other HLA-DR alleles are known in the an 
(see, e.g.. Marsh and Bodmer, 1992) and are available through genetic databases. This pocket 
does not appear to contribute greatly to the specificity of HLA-DR 1 but may be important in 
1 5 other alleles. 

The P9 pocket of HLA-DR molecules is generally a small hydrophobic pocket and, 
therefore, small hydrophobic residues are preferred at the P9 position of the antigen. This pocket 
is formed by the conserved a chain residues a69, a72, a73 and a76 and by the polymorphic B 
chain residues B9 and B57. For example, in HLA-DRI (DRA, DRB1*0101 ) the P9 pocket is 
20 formed by a69 (Asn), a72 (He). a73 (Met). a 76 (Arg), B9 (Trp) and B57 (Asp). The 

corresponding residues for other HLA-DR alleles are known in the art (see. e.g.. Marsh and 
Bodmer. 1992) and are available through genetic databases. 

The P6, P7 and P9 pockets appear to be less important than the PI and P4 pockets in 
binding to DR molecules but they may be more important in binding to other isotypes (e.g.. the 
25 P9 pocket of DQ may be important). 

B. The TCR Con tact freshes 

When there is no known or suspected antigen involved in an autoimmune response, the 
positions of the sequence motif corresponding to the TCR contact residues may be left 
unrestricted. That is, absent a known or suspected antigen, the TCR contact positions of the 
30 motif are preferably allowed to vary amongst all of the amino acids. 

When, on the other hand, there is a known or suspected antigen involved in an 
autoimmune response, at least some of the motif positions corresponding to the TCR contact 
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residues may be restricted according to the sequence of the antigen. Thus, for example, the P2 
and/or P3 and/or P5 positions of the motif may be restricted to only those residues found at the 
corresponding positions of the antigen. Alternatively, at least some of the TCR contact residues 
of the motif may be restricted not just to the corresponding residues of the antigen but may be 
5 allowed to vary amongst similarly charged and/or structurally similar residues (e.g., K and R). It 
should be noted, however, that greater conservatism with respect to the TCR contact residues of 
the motif is justified by the presumably greater specificity of TCR binding relative to the known 
promiscuity of MHC binding. 

C. Drntopins an MLA-PR Sequence Motif 

10 Given the present disclosure of the HLA-DR residues involved in the formation of the PI . 

P4, P6, P7 and P9 MHC binding pockets, and given the nucleotide or corresponding amino acid 
sequence of any particular HLA-DR allele, one is now enabled to develop a sequence motif 
useful in evaluating or predicting the ability of peptides to bind to that MHC protein. When a 
particular antigen is known to (or is suspected of) binding to the MHC protein, the TCR contact 

15 residues of that antigen may also be considered in the motif. 

The method first requires the selection of two or more of the MHC binding pockets for 
which the choice of peptide residues will be restricted at the corresponding positions of the 
motif. One may select all five of the major binding pockets and develop a motif in which the 
corresponding five positions of the motif are restricted or one may select fewer and develop a 

20 less restricted motif. As will be obvious to one of ordinary skill in the art. a more restricted 
motif will identify a lesser number of peptides in a database search and a less restricted motif 
will identify a greater number of peptides. In all instances, at least two of the major binding 
pockets should be selected. When fewer than all five MHC binding pockets are selected, it is 
preferred that at least one is PI and that a second is chosen from P4. P6 and P9. 

25 Either before or after the pockets to be restricted by the motif are selected, the set of amino 

acid side chains likely to bind within each of those pockets and. therefore, the set of amino acid 
residues that will define the corresponding positions of the motif, must be determined. This may 
be accomplished by one of ordinary skill in the art by considering the amino acid residues which 
form the pocket. These residues, identified in Section A above, will determine the size and 

30 nature (i.e.. hydrophobic, hydrophilic. positively charged, negatively charged, uncharged) of the 
pocket and consequently, the side chains which may bind within the pocket. Reference may be 
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had to Figure 1 during these considerations but will become increasingly unnecessary as one 

develops familiarity with the variations of the pockets. 

As a general matter, in light of the identification of the residues forming the MHC binding 

pockets of the HLA-DR proteins disclosed herein, one of ordinary skill in the art can easily 
5 develop a sequence binding motif for any HLA-DR protein for which these residues are known 

for two or more binding pockets. The major considerations are size, hydrophobicity and charge. 

In light of the present disclosure, each of these considerations may be addressed according to 

well-known principles. A baseline is disclosed herein for each pocket for the DRB1 *0101 allele. 

Relative to this HLA-DR protein, one of ordinary skill is enabled to develop motifs for other 
10 HLA-DR alleles. Thus, substitutions which lead to larger/smaller pockets suggest that the 

corresponding motif positions should be restricted so as to permit smaller/larger residues. 

Similarly, more/less hydrophobic pockets suggest that the corresponding motif positions should 

be restricted to more/less hydrophobic residues. Finally, positively/negatively charged pockets 

suggest that positively/negatively charged residues should be excluded and negatively/positively 
15 charged residues may be included at the corresponding motif positions. As noted above, the 

present disclosure enables one of ordinary skill to develop motifs based upon well-established 

principles. 

For example, and not by means of limitation, consider the PI pocket of the HLA-DR 
protein. The residues forming this pocket in the DRB1*01 01 were described above. For 

20 DRB1 *0101 , the PI pocket is large and hydrophobic and can accommodate any of the aliphatic 
or aromatic residues (e.g., V, L. I. A. M F. Y. W). For the DRB1 * 1602 protein the same is true. 
On the other hand, in the DRB1 * 1 501 protein, the 686 position is occupied by Val instead of the 
Gly found in DRB1*0101 and DRB1 *1602. This substitution decreases the size of the PI pocket 
in this MHC protein and. as a result, the pocket cannot easily accommodate Tyr or Trp side 

25 chains. Thus, for DRB 1*1501, the sequence motif at position PI may be restricted to residues 
chosen from V. L, 1, A. M and F. 

Similarly, in light of the present disclosure, one of ordinary skill in the art may consider 
each of the MHC class II binding pockets, or only selected pockets, and develop a sequence 
motif for any MHC class II protein for which the residues involved in pocket formation are 

30 known. These residues will determine both the size and nature of the pocket and. thereby, the 
size and nature of the residues which may bind within it. When the pocket is relatively small, the 
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largest amino acid residues (e.g.. Y. W) may be excluded from the corresponding position of the 
motif and. when the pocket is charged, amino acid residues of the same charge may be excluded. 

If a self or foreign epitope involved in immune response is known or suspected, and 
particularly if its TCR contact residues can be defined through the use of responsive T cell 
5 clones, the TCR contact residues of the epitope may also be considered in developing a sequence 
motif. As with the MHC contact residues, all or merely some of the TCR contact residues may 
be restricted in the motif. And. as with the MHC positions, the restriction of more positions (or 
the greater restriction of any one position) will result in the identification of fewer peptides in a 
database search. Unlike the MHC contact residues, for which at least two positions should be 
10 restricted in the motif, it is acceptable to omit any restrictions of TCR contact residues in the 
motif. 

If any TCR contact residue positions are restricted in the sequence motif, it is preferred 
that a position selected from positions P2. P3 and P5 be chosen. Because, in contrast to the 
relative promiscuity of MHC class II binding pockets. TCR contact residues appear to have 

1 5 greater specificity, it is preferred that any TCR contact residue positions which are restricted in 
the motif be rather narrowly restricted. That is. it is preferred that such positions be restricted to 
just the residue found at the corresponding position of the known antigen or just to residues 
which are highly similar in structure and charge. 

For example, and as further detailed below, the MBP (85-99) peptide is known to be the 

20 autoantigen implicated in multiple sclerosis. The P3 residue of this peptide is Phe (residue 91 of 
MBP) and the P5 residue is Lys (residue 93 of MBP). Thus, if P3 is restricted in the motif, it is 
preferred that it be restricted to F or. possibly, to F and Y. Similarly, if P5 is restricted, it is 
preferred that it be restricted to only K or to K and the similarly charged R. Alternatively. P3 
and P5 may be left unrestricted. 

25 Obviously. MHC and TCR positions not selected for restriction may be represented by. in 

the notation of this disclosure, an X. Similarly, as shown in the examples below, several motifs 
may be developed with varying numbers of positions restricted to varying extents. 

II. Identifying Self Epitopes Using Sequence Motifs 

30 An ever increasing number of autoimmune diseases are now being associated with 

particular alleles of the MHC Class II HLA-DR locus. For most of these autoimmune diseases. 
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the self epitope remains unknown. For some, however, a self protein involved in autoimmune 
response is known or suspected. 

In one aspect of the present invention, a method is provided for identifying the self 
epitopes involved in autoimmune diseases associated with MHC class II alleles. That is. by 
5 comparing human peptide sequences to the sequence motifs of the present invention, one is 
enabled to identify those peptides which have the highest likelihood of being the self epitopes 
involved in the disease. 

The method may be applied to any of the autoimmune diseases for which an association 
with a particular HLA-DR allele is known and for which the amino acid residues forming the 

1 0 MHC binding pockets (or at least two of the major pockets) for that allele are known. In 
accordance with the method discussed herein, one may then develop one or more sequence 
motifs for the HLA-DR protein associated with the disease. Naturally, if the disease is 
associated with two or more alleles, motifs may be developed for the two or more HLA-DR 
proteins and. in particular, a consensus motif may be developed using the residues shared by 

1 5 each position of these motifs. 

The sequence motif or motifs thus developed are then compared to appropriate sets of 
human peptide sequences. The human peptide sequences may include all known human 
sequences or may be limited in ways which will be obvious to one of ordinary skill in the art. 
For example, if the disease is restricted to particular tissues, the search may be limited to 

20 peptides found in those tissues. Conversely, peptides which are also found in unaffected tissues 
may be eliminated from the search pool. In the most extreme case, when the autoantigen is 
known or suspected but the particular epitope is unknown, the search may be limited to 
sequences within the autoantigen (see Example 1 ). 

This method may be used to identify a set of peptides which match the motif and which are 

25 most likely to be self epitopes. By varying the number of positions restricted by the motifs. 

and/or the extent of restriction at each position, and/or the size of the search pool, the number of 
peptides in the set will, in all likelihood, also be varied. As noted above, at least two of the MHC 
contact positions (e.g.. PI and P4) should be restricted. Depending upon the number of peptides 
in the resultant set. a more or less restrictive motif may then be employed to reduce or expand the 

30 set. The desired size of the resultant set depends, of course, upon the subsequent intentions of 
the practitioner of this method. 



WO 96/27387 PCIYUS96/03182 

- 17 - 

Once a set of peptides has been identified, these peptides may optionally be screened for 
activity. The choice of such screens is at the discretion of the practitioner and beyond the scope 
of the present invention. Preferred screens, however, include in vitro tests for the ability to 
induce the proliferation of autoreactive T cells or to induce the secretion of lymphokines 
(cytokines) from these T cells or to induce other effector functions such as cytotoxicity. In some 
circumstances, human in vivo tests may be appropriate and in other circumstances animal models 
of the human disease may be available. 



HI- Identifying Foreign Epitopes Implicated in Human Autoimmune Dkea^ 

As noted in the Background section, epidemiological evidence has suggested that various 
bacterial and viral pathogens may be implicated in human autoimmune diseases and the concept 
of molecular mimicry pervades the literature (reviewed by Oldstone. 1990). Prior attempts to 
identify particular foreign epitopes involved in human autoimmune disease, however, have 
depended upon direct sequence similarity to known human epitopes. The results have been 
disappointing and, to date, no pathogens or peptides derived from pathogens have been shown to 
be a primary cause of human autoimmune disease. 

Thus, in another aspect of the present invention, a method of identifying foreign epitopes 
implicated in human autoimmune disease is provided. That is. for the first time a method of 
identifying such foreign epitopes is provided which employs sequence motifs to identity foreign 
peptides which have the highest likelihood of being involved in the aetiology of human 
autoimmune disease. 

The method may be applied to any of the autoimmune diseases for which an association 
with a particular MHC class II protein is known and for which either (Da sequence motif has 
been developed by prior art methods or (2) a sequence motif may be developed by the methods 
of the present invention. When the self epitope is known or suspected. TCR contact residues 
may be included in the motif. As before, one or more motifs may be employed and differently 
derived motifs may be combined to develop consensus motifs. 

The sequence motif or motifs thus developed may then be compared to appropriate sets of 
peptide sequences derived from human pathogens. This is most conveniently accomplished 
using genetic databases widely available to those of skill in the art. In a most preferred 
embodiment, the search pool is limited in one or more of the following ways: ( I ) only sequences 
from human bacterial or viral pathogens are included: (2) sequences from the normal human 
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intestinal flora (e.g., E. coli or other Enterobacteriaceae ) are excluded: and (3) sequences from 
pathogens are included/excluded depending upon whether the geographical or epidemiological 
incidence of the pathogens are positively/negatively correlated with the incidence of the 
autoimmune disease in question (see Example 2). 
5 This method may be used to identify a set of foreign peptides which match the motif and 

which are most likely to be involved in the human disease. As before, the number of peptides in 
the set can be varied by using more or less restrictive motifs and/or by varying the search pool. 
And. as before, the resultant set of peptides may subsequently be subjected to any of a variety of 
known screens for activity. 

0 

IV. Self and Foreign Epitopes Identified bv the Methods of th e Present Invention 

As detailed in the examples below, the methods of the present invention have been 
employed (1) to identify seven self epitopes of the desmoglein 3 protein implicated in pemphigus 
5 vulgaris and (2) to identify eight foreign epitopes from human pathogens implicated in multiple 
sclerosis. 

Each of these peptides is fifteen residues in length, partly as a result of the computer 
database search program used (Genetics Computer Group program Tindpatterns") but also 
corresponding to the size of the cleft in MHC class II molecules. The fifth position of each 
corresponds to the PI residue of the antigen. Thus, the P-2 to PI 1 residues which span the MHC 
Class II binding cleft correspond to the third through fifteenth residues of these sequences. The 
P-l to P9 residues which are important to MHC and TCR binding correspond to the fourth 
through thirteenth positions. The most important residues for MHC and TCR binding. P-l to P6 
correspond to the fourth through tenth positions of these sequences. 

SEQ ID NO.: 1 through SEQ ID NO.: 7. also shown in Table 1 as PVA.l through PVA.7. 
correspond to residues 78-93. 97-1 11. 190-204. 206-220. 251-265. 512-526 and 762-786 of the 
human desmoglein 3 protein. These peptides are implicated as self epitopes in pemphigus 
vulgaris. Already, as described in Example 1. two of these peptides have been shown to cause 
proliferation of T cells isolated from two patients with pemphigus. 

SEQ ID NO.: 8 through SEQ ID NO.: 1 5. also shown in Table 2 are internal fragments of 
the herpes simplex virus UL 1 5 protein, adenovirus Type 12 ORF. Pseudpmpnas agruginQS? 
phosphomannomutase. human Papillomavious Type 7 L2 protein. Epstein-Barr virus DNA 
polymerase. Influenza Type A hemagglutinin protein. Reovirus Type 3 sigma 2 protein, and 
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herpes simplex DNA polymerase, respectively. These peptides are implicated as foreign 
epitopes involved in the aetiology or in remissions of multiple sclerosis. As detailed in Example 
2 below, each has been shown to be capable of inducing the proliferation of autoreactive T cell 
clones isolated from human multiple sclerosis patients. The sequence of the MBP (85-99) 
5 peptide is disclosed as SEQ ID NO.: 16. 

Each of these proteins has a variety of utilities and. therefore, in another aspect, the present 
invention provides each of these peptides in isolated form. In addition to the sequences of fifteen 
residues shown in the Sequence Listing and Tables, the present invention also embraces the 
fragments of these peptides corresponding to the MHC binding domain. In particular, the 

10 invention provides peptides corresponding to the P-2 to PI 1. P-l to P9 and P-l to P6 positions of 
each of SEQ ID NO.: 1 through SEQ ID NO.: 15. As will be obvious to one of ordinary skill in 
the art. however, any fragment of any of these peptides which includes at least the PI and P4 or 
at least the PI and P6 or at least the P4. P6 and P7 residues may have utility and is intended to 
fall within the spirit and scope of the claims. In particular, longer peptides including these 

15 peptides or peptides including at least the MHC binding and TCR contact residues described 
above are contemplated as equivalents. 

The manner of production of these peptides is inconsequential but they may be isolated 
and purified from their natural sources or they may be synthesized. Because of their relatively 
short length, it is presently contemplated that they should be produced by synthesis. Methods of 

20 isolation, purification and synthesis of such peptides are well known in the art and need not be 
recited herein. 

The present invention also provides products and methods using other peptides which may 
be identified by the methods of the present invention. These peptides, as well as those disclosed 
above, may be used in each of the following embodiments. 

25 The peptides of the present invention may be used for in vitro assays to aid in the diagnosis 

and classification of pemphigus vulgaris and multiple sclerosis. For example, autoreactive T 
cells from patients with PV and MS may be tested, as in the examples below or by other known 
assays, for reactivity with these peptides. The ability or inability of these peptides lo cause 
proliferation of the T cells will, in the case of pemphigus, allow for a refinement of the diagnosis 

30 by particular desmogleih 3 epitopes and. in the case of multiple sclerosis, allow further 

classification of the disease by cross-reactive (self and foreign epitope) types. Immune response 
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to these peptides prior to onset of disease may also be used as an indication of susceptibility or 
predisposition although care should be taken so as not to induce autoantigenic response. 

The peptides of the present invention may also be used in the development of animal 
models by immunizing animals (e.g.. mice, rabbits, non-human primates) with these peptides. 
Animals which not only develop a response to the peptide but which also develop an 
autoimmune disease corresponding to the human pathology will have obvious utilitv as models 
for the human disease. Animals which develop a response to the peptide without developing any 
corresponding autoimmune disease, will have utility as subjects for experiments involving the 
selective depletion of T cells or other forms of desensitization or tolerization. 

Importantly, these peptides and amino acid analogs of these peptides will have utility as 
therapeutic and diagnostic agents. The pathogens, viruses or bacteria from which they are 
derived will have utility as vaccinating agents. Some examples of the utility of these materials 
include the following. 

The peptide may be administered in high doses to produce high dose tolerance. This 
process of tolerization is described in. for example. PCT patent application US93/08456 
(International publication number WO 94/06828). Thus, in one set of embodiments, the 
invention provides for pharmaceutical preparations for use in tolerizing individuals to 
autoantigens. The preparations may include a pharmaceutical^ acceptable carrier and an 
isolated human polypeptide which includes an amino acid sequence corresponding to a sequence 
motif for a MHC class II protein, such as an HLA-DR protein, which is associated with a human 
autoimmune disease. These polypeptides are capable of binding to the protein to form a complex 
which activates autoreactive T cells in subjects having the autoimmune disease. By using the 
peptides disclosed herein or identified by the methods of the invention, such pharmaceuticals can 
be used to combat autoimmune response. The use of such tolerization to human autoimmune 
diseases is known in the art and need not be elaborated upon here. Tolerizing doses of collagen 
for rheumatoid arthritis and myelin basic protein for multiple sclerosis have been used. The 
present invention therefore specifically does not embrace these proteins. Other peptides, 
however, may now be identified by the present methods and be similarly used to treat 
autoimmune disease. 

In particular embodiments, such pharmaceutical preparations are provided in which the 
protein is HLA-DR4 or HLA-PQ1 protein and the autoimmune disease is pemphigus vulgaris. 
In addition, using PV motif #1 . pharmaceuticals having peptides with this motif are provided. In 
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most preferred embodiments, the pharmaceuticals include at least one of the polypeptides of 
SEQ ID NO. 1 through SEQ ID NO.: 7. Thus, methods of tolerizing an individual to a 
pemphigus vulgaris autoantigen are also provided. 

In a similar set of embodiments, the invention provides for pharmaceutical preparations for 
5 use in tolerizing individuals to antigens of human pathogens which are implicated in human 
autoimmune disease. The preparations include a pharmaceutical!}* acceptable carrier and an 
isolated human pathogen polypeptide which includes an amino acid sequence corresponding to a 
sequence motif for a MHC class II protein, such as an HLA-DR protein, which is associated with 
a human autoimmune disease. These polypeptides are capable of binding to the protein to form a 
10 complex which activates autoreactive T cells in subjects having the autoimmune disease. Thus, 
by tolerizing an individual to these antigens. T cells which are cross-reactive with the self 
antigen will be renedered unresponsive or anergized and protection from the disease will be 
afforded. 

In particular embodiments, such pharmaceutical preparations are provided in which the 

15 protein is HLA-DR2 protein and the autoimmune disease is multiple sclerosis. In addition, using 
the the three MS motifs disclosed herein, pharmaceuticals having peptides with at least one of 
these motifs are provided. Specific embodiments of the pharmaceuticals include at least one of 
the polypeptides disclosed as SEQ ID NO.:8 through SEQ ID NO.: 15. Thus, methods of 
tolerizing an individual to a multiple sclerosis foreign antigen are also provided. 

20 In set of embodiments, pharmaceuticals are provided for vaccination against a human 

pathogen implicated in the aetiology of autoimmune disease. These pharmaceutical preparations 
include a pharmaceutical ly acceptable carrier and an immunogenic preparation effective to 
immunize against a human pathogen. The human pathogen is one which in its native form 
includes a polypeptide having an amino acid sequence corresponding to a sequence motif for a 

25 MHC class II protein, such as an HLA-DR protein, which is associated with the autoimmune 
disease. These polypeptides are capable of binding to the protein to form a complex which 
activates T cells which become autoreactive and intiate the autoimmune disease. The 
preparations of the present invention specifically do not include such polypeptides but. rather, 
include other antigens from the pathogen. That is. a vaccine is produced which specifically does 

30 not include polypeptides which correspond to the sequence motifs for the protein and. if known, 
the TCR contacts of the self epitope. Because pathogens present a wide array of antigenic 
determinants, one may eliminate those which corespond to the relevant sequence motif and 
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produce a vaccine which is effective against the pathogen but which will not include peptides 
implicated in the autoimmune response. 

Such vaccines, lacking peptides corresponding to the sequence motifs of the present 
invention, may be made in any convenient manner by one of ordinary skill in the art. For 
5 example, when producing an influenza vaccine, one may compare the peptide sequences of the 
influenza virus to sequence motifs developed according the present invention. The vaccine may 
then be made excluding the proteins which have the motif sequences (e.g. by using a fragment of 
the virus) or recombinant techniques may be used to produce a virus in which the sequences 
corresponding to the motif are altered such that they do not match the motif. In preferred 

10 embodiments, the altered residues are the TCR contact residues and. in particular a substitution 
which alters the charge of a TCR contact residue is preferred. Similar vaccines can be developed 
for bacterial pathogens using, for example, only portions of the bacterium (e.g. bacterial surface 
proteins or membrane associated proteins) which lack a motif associated with human 
autoimmune disease or, again, genetically altering the vaccine bacteria so as to change residues. 

15 The motifs which are considered in creating such vaccines may be chosen on any of 

several grounds. If the pathogen to which the vaccine is directed is associated with an 
autoimmune disease, a motif may be developed according to the methods described herein for an 
HLA-DR protein associated with the disease. If there is also a known or suspected autoantigen. 
the motif may include TCR contact residues of the self epitope. The pathogens protein 

20 complement may then be compared to the motif and peptides corresponding to the motif may be 
omitted from the vaccine or may be altered by recombinant means to produce a vaccine free of 
such peptides. Alternatively, vaccines may be developed with particular populations in mind. 
For individuals suffering from or at risk of developing a particular autoimmune disease, special 
vaccines may be developed. In this case, the motif can be chosen on the basis of the HLA-DR 

25 protein associated with the autoimmune disease and. when known, the TCR contact residuesof 
the self epitope. 

In particular embodiments, such vaccine preparations are provided in which the HLA-DR 
protein is HLA-DR4 or HLA-PQ1 protein and the autoimmune disease is pemphigus vulgaris. 
And. in particular, vaccines are provided lacking a peptide corresponding to the PV motif #1 
30 disclosed herein. Specific embodiments of the vacines include vaccines lacking at least one of 
the peptides disclosed as SEQ ID NO.: 1 through SEQ ID NO.: 7. Thus, methods of immunizing 
an individual against pathogens which may cause pemphigus vulgaris are also provided. 
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Similarly, vaccine preparations are provided in which the HLA-DR protein is HLA-DR2 
protein and the autoimmune disease is multiple sclerosis. And. in particular, vaccines lacking a 
peptide corresponding to at least one of the three MS motifs disclosed herein are provided. 
Specific embodiments of the vacines include vaccines lacking at least one of the peptides 
5 disclosed as SEQ ID NO.: 8 through SEQ ID NO.: 1 5. In particularly preferred embodiments, 
the vaccines are developed from the pathogens listed in Table 2 but lack either the entire protein 
listed as the antigen or lack at least the peptides identified in the corresponding sequences. Thus, 
methods of immunizing an individual against pathogens which may cause multiple sclerosis are 
also provided. 

10 These peptides will also be useful in assessing which pathogen(s) may be important in a 

particular patient. For example, the T cells from one patient may proliferate in response to one 
or a few of these peptides, while those from another patient may proliferate in response to a 
different peptide or set of peptides. Analogs of the peptides may be synthesized in which one of 
the T cell receptor contact residues is substituted. For example, in the case of MS. peptides with 

1 5 the substitution of MBP91F by 91 A or of MBP93K by 93 A may be employed. Such analogs are 
not, however, limited to the substitution of these primary T cell receptor contact residues of to 
substitution by particular amino acids such as A. These peptide analogs may be used to anergize 
(inactive) autoreactive T cells (see. for example, Sloan-Lancaster et al.. 1993 and 1994) upon 
administration to autoimmune patients. The viruses or bacterial pathogens may be useful in 

20 immunization by selecting viral or bacterial strains that do not earn* the mimicry epitope. 
Proteins from these pathogens other than those that carry the mimicry epitope may also be 
selected for immunization. This therapy may be useful in preventing reinfection and thus 
remissions of the disease or in preventing initial infections in particularly susceptible populations 
(the most obvious example of which is a disease-free identical twin of a patient). 

25 

EXAMPLES 

I. Identification of Self Epitopes of Pemphigus Vulgaris 

As noted above, pemphigus vulgaris (PV) is. in different ethnic groups, associated either 
with a DR4 allele (DRB 1*0402) or with a rare DQ1 allele (DQB1*05032): only a small fraction 
30 of PV patients have neither susceptibility gene (Ahmed et al.. 1991; Ahmed et al.. 1990: Scharf 
et al.. 1 988b). The PV associated molecule has a negative charge (Glu) at the critical 
position B71 : the neighboring position (B70) is also negatively 
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charged. The DR4 subtype associated with PV is the only one that carries a negative charge at 
DRB 71 (a positive charge (Arg) is found at DRB 71 in the RA associated DR4 molecules). 
Although polymorphic, the P7 pocket residue DRB 67 (Leu/Ile) does not appear to be involved in 
peptide binding but probably acts as a TCR contact residue (Stern et al.. 1994). 

The charge of a polymorphic residue at DRB 71 could therefore account for susceptibility 
to two different autoimmune syndromes associated with structurally similar DR4 subtypes: DR4 
alleles associated with susceptibility. to rheumatoid arthritis have a positive charge at DRB 71 
(Arg) while the DR4 allele associated with pemphigus vulgaris has a negative charge at DRB 71 
(Glu). Peptides selective for either DR4 molecule may therefore differ significantly in their 
charge at P4: Peptides with a negative charge at P4 would be expected to bind to the RA 
associated molecules but not the pemphigus associated DR4 molecule; in contrast a positive 
charge would be expected for the pemphigus peptide(s) at position 4. Due to the conserved 
nature of these molecules other peptide anchor residues (PI and P6) would not be expected to be 
different for these DR4 subtypes. 

A sequence motif for selective binding to the HLA-DR DRB1 *0402 protein was 
developed according to the method disclosed herein. 

The B chain residues involved in forming the PI pocket for this allele are B85 (Val). B86 (Val). 
B89 (Phe) and B90 (Thr). Thus, the presence of Val at B86 (instead of Gly as in DRB1*0101 ) 
suggests that the PI position of the motif be restricted to V. L, I. M and F. Alanine might also 
have been included but was not in this example. The P6 pocket is formed in part by the Bl I 
(Val) and Bl 3 (His) of the DRB 1*0402 protein. Relative to the DRB1 *0101 allele, in which 
these residues are Leu and Phe respectively, the P6 pocket of the DRB1 *0402 protein is 
somewhat larger and more polar. Thus, for the P6 position of the motif. S. T. N and V were 
allowed. Finally, the P4 pocket of this DR protein is formed in part by the residues Bl 3 (His). 
B70 (Asp). B71 (Glu). B74 (Ala) and B78 (Tyr). As noted above, the two negatively charged 
residues as B70 and B71 create a preference for positively charged antigen residues and. 
therefore, the motif for the P4 position was restricted to K and R. 
Thus, the sequence motif for the pemphigus vulgaris autoantigen was defined as: 



Position 
PV Motif #1: 



PI P2 P3 P4 P5 P6 

V X X K X S 
L R T 

1 N 
M V 
F 
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Although the autoantigen for pemphigus vulgaris is known, the precise epitopes within the 
autoantigen have previously remained unknown. Using the method of the present invention, 
however, it has been possible to identify a small set of peptides which may serve as the 
autoantigenic determinants. The target antigen of pemphigus vulgaris is an epithelial adhesion 
5 molecule of the cadherin family, desmoglein 3 (Amagai et aL 1 991 ). Desmoglein 3 mediates 
Ca~* dependent adhesion between keratinocytes; the autoantibodies interfere with cell adhesion 
with resulting blister formation (Takeichi, 1990). The autoantibodies are thought to be 
pathogenic since a transient blistering disease is also seen in newborns of affected mothers due to 
transfer of maternal immunoglobulin to the fetus. Transfer of serum or desmoglein 3 specific 
10 antibodies to mice also results in acantholysis (Amagai et al., 1992). 

Only seven peptides from this large protein (130 kDa. 999 amino acids) matched the motif. 
These seven pemphigus vulgaris antigens (PVA.1-PVA.7) are presented below in Table 1 with 
the residues corresponding to the MHC binding positions PL P4 and P6 underlined. 





TABLE 1 




PVA.l 


(res. 78-93) 


ATQKITYRISGVGID 


PVA.2 


(res. 97-111) 


FGIFVVD&NTGDINI 


PVA.3 


(res. 190-204) 


LNSKIAFKIVSQEPA 


PVA.4 


(res. 206-220) 


TPMFLLSRNTGEVRT 


PVA.5 


(res. 251-265) 


CECN1KV&DYNDNFP 


PVA.6 


(res. 512-526) 


SARTLNNRYIGPYTF 


PVA.7 


(res. 762-786) 


OSGTMRTRHSTGGTN 



Therefore, the selective presentation of one or several of these peptides by the PV 
associated DRB1 *0402 molecule to T cells may be critical for initiating autoimmunity in PV. 

25 To test this. T cell lines were raised from blood mononuclear cells of two patients with active 
disease by stimulation with the seven candidate peptides. T cell lines were expanded with rIL-2 
and tested for recognition of the candidate peptides in a proliferation assay. T cell lines from 
both patients recognized two peptides from the extracellular domain of desmoglein 3 (PVA.3 and 
PVA.4) that were located close to the major autoantibody recognition site. These T cell lines 

30 were HLA-DR restricted as T cell proliferation was blocked by a monoclonal antibody specific 
for HLA-DR but not by a control antibody. These desmoglein 3 peptides are therefore 
candidates for the T cell dependent induction of autoimmunity in pemphigus vulgaris. 
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The motifs of the present invention relating to the HLA-DR proteins can be taken one step 
further. In a different ethnic group PV is associated with a rare DQ1 subtype (DQB1 *05032) 
that differs from the common DQ1 subtype only at position 57 of the DQB chain (Sinha, et aL 
1988). In the PV associated molecule DQB 57 is negatively charged (Asp) whereas in the 
5 common DQ1 subtype it is not. The same position on the DQB chain has also been implicated in 
susceptibility to diabetes. In diabetes, however, the reverse is true: DQ2 and DQ8 molecules 
associated with susceptibility to diabetes do not have a negative charge at DQB 57 (Todd et aL 
1987). 

Based on these observations it becomes clear that two polymorphic positions in the MHC 
10 class II B chain (position 71 of DRfi and position 57 of DQB) are critical for selective peptide 
binding and the development of autoimmunity. Based on the criteria described above, a diabetes 
linked peptide would be expected to have a negative charge at P9 since such a peptide would 
only bind to DQ molecules that do not have the same charge at DQB 57. In contrast, for the DQ1 
associated cases of pemphigus, a peptide with a positive charge at P9 may be selective for the 
1 5 disease associated molecule which carries a negative charge at DQB 57. In the case of DR4 

linked autoimmunity, the charge at peptide position 4 confers selectivity to the disease associated 
DR4 molecule: RA peptides have a negative charge at P4, PV peptides a positive charge at P4. 
Motifs for selective peptide binding may therefore prove to be tremendously useful in the 
identification of key epitopes that initiate human autoimmune diseases. This approach is 
20 expected to be useful not only for identifying peptides in PV. RA or diabetes but also for other 
autoimmune diseases where residues critical in peptide binding have been linked to disease 
susceptibility. 

2. Identificatio n of Mimics of the Multiple Sclerosis MBP Autoantigen 

25 Susceptibility to MS is associated with HLA-DR2 (DRA. DRB1 * 1 501 . the most common 

subtype of DR2) (Spielman et aL. 1982: Olerup et aL 1989). This MHC class II molecule is 
believed to play a critical role in the immunopathogenesis of MS by presenting immunodominant 
self peptides to autoreactive T cells. Following injection of MBP in experimental animals. T 
cells specific for immunodominant peptides of MBP mediate an inflammatory response in the 

30 CNS that can be accompanied by marked demyelination (reviewed in Zamvil and Steinman. 
1 990). In previous studies two regions of human MBP were found to be immunodominant 
(residues 84-102 and 143-168) (Ota et al. 1990: Pette et aL 1990: Martin et al. 1990: 
Wucherpfennig et al. 1994a). Reactivity to the MBP(84-102) peptide was predominantly seen in 
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subjects carrying HLA-DR2. Using L cell transfectants as antigen presenting cells. HLA-DR2b 
(DRA. DRB 1*1501) was found to serve as the restriction element for these MBP(84- 1 02) 
specific T cell clones. 

The MBP(84-102) peptide binds with high affinity to the HLA-DR2b molecule with two 
5 hydrophobic residues serving as the anchors PI (Val 89 of MBP) and P4 (Phe 92 of MBP) 
(Wucherpfennig et aL 1 994a; Vogt et al. 1 994). At position PI . Val 89 can be substituted by 
other aliphatic amino acids (Leu. He), as well as by methionine and phenylalanine; alanine is 
tolerated at this position but reduces the affinity of the peptide for HLA-DR2b. At position P4. 
all aliphatic and aromatic residues are permitted; again alanine is tolerated but results in a loss of 
10 binding affinity. 

A mutational analysis of putative TCR contact points confirms that P3 (Phe 91 ) and P5 
(Lys 93) are TCR contacts for the MBP(85-99) specific clones; other residues such as P-l (Val 
88) and P2 (His 90) are important for some clones but not for others. Substitution of P3 (Phe 91 ) 
by alanine abolishes TCR recognition for all clones; some clones tolerated conservative 

1 5 substitutions (eg. tyrosine or aliphatic amino acids) while other clones do not. Substitution of P5 
(Lys 93) by arginine is tolerated by most T cell clones but more drastic changes frequently result 
in a partial or complete loss of T cell reactivity. This analysis demonstrates that P2 (His 90). P3 
(Phe 91 ) and P5 (Lys 93) are the primary TCR contact residues for MBP while PI (Val 89) and 
P4 (Phe 92) are the primary MHC contact residues for MBP. This analysis also confirms the 

20 present teaching that the TCR contact residues of a motif should be highly if not absolutely- 
conserved. 

Based on this structural characterization of the immunodominant MBP(85-99) peptide, 
three sequence motifs were developed to search a protein database for viral and bacterial peptides 
that matched these requirements. The motifs focused on the core region of the peptide, residues 

25 P-l to P5 (88-93 of the MBP protein), which contained the MHC and TCR contacts common to 
all clones. In the first motif aliphatic amino acids were allowed at the first MHC anchor residue 
PI while both aliphatic and aromatic residues were permitted at the second MHC anchor P4. For 
the TCR contacts, the Phe 91 at P3 was absolutely conserved, the Lys 93 at P5 could be 
substituted only by arginine. while the His 90 at P2 and the Val 88 at P-l could be substituted by 

30 several structurally related amino acids. Thus, the first motif for multiple sclerosis antigens was 
defined as: 
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Position 


P-l 


PI 


P2 


P3 


P4 


P5 


MS Motif #1: 


V 


V 


H 


F 


F 


K 




L 


L 


N 




Y 


R 




I 


I 


Q 




W 






A 


A 


F 




V 






M 


M 






L 





I 

A 
M 



10 

The second motif omitted the Val 88 P-l as a TCR contact residue (used only by some 
clones) and permitted also aromatic amino acids at the first MHC anchor PI (Val 89). This was 
done since the MBP(85-99) peptide is presented by different HLA-DR2 subtypes: Presentation 
by DRB1 * 1 501 requires an aliphatic amino acid or phenylalanine at this position while aliphatic 
1 5 and all aromatic residues at this position can serve as anchor for DRB 1 * 1 602. This difference, as 
explained above, relates to the size of the primary pocket for binding this hydrophobic residue 
and is determined by the Val/Gly dimorphism at DRB86 (Val in *1501 and Gly in *1602) (Busch 
et al.. 1991 ). Thus, the second motif for multiple sclerosis antigens was defined as: 



P2 P3 P4 P5 
H F F K 
N Y R 

Q W 
F V 
L 
1 

A 
M 

30 The third sequence motif represented a modification of the TCR contact residues preferred 

by a subgroup of MBP(85-99) specific clones. For these clones. P5 (Lys 93) was absolutely 
conserved while P3 (Phe 91 ) could be substituted by some aromatic or aliphatic amino acids. 
The third motif for multiple sclerosis antigens was defined as: 

P2 P3 P4 P5 

H F F K 

F Y Y 

Y W W 

W V V 

L L 

I I 



20 Position PI 

MS Motif #2: F 
Y 
W 
V 

25 L 

1 

A 
M 



35 Position PI 

MS Motif #3: V 
L 
1 

A 

40 M 



WO 96/27387 PCI7US96/03182 

- 29 - 

These HLA-DR motifs also matched well with the structural requirements for an 
HLA-DQ1 restricted clone specific for the MBP(85-99) peptide. This clone required the same 
minimal peptide segment as DR2 restricted clones (residues 87-97). As in the DR2 restricted 
clones. P2 (His 90). P3 (Phe 91) and P5 (Lys 93) appeared to be the primary TCR contact 
5 residues. Substitution of these hydrophobic positions by aspartic acid greatly diminished the 
stimulatory capacity of the peptide while substitutions by other hydrophobic amino acids were 
tolerated. These data suggest that the MBP(85-99) peptide is bound in a similar fashion to 
HLA-DR2b and to HLA-DQ1 and that the same peptide residues are critical for interaction with 
the TCR. 

10 These motifs were used as search criteria in a search of protein databases (PIR and 

SwissProt) using the Genetics Computer Group software (program: findpatterns). More than 600 
sequences of viral and bacterial origin were identified that matched these criteria. From this set. 
sequences were selected based on the following criteria: (1) Viruses known to cause human 
pathology. (2) Viruses prevalent in the Northern Hemisphere where MS occurs most frequently. 

15 (3) Selected bacterial sequences associated with inflammatory CNS disease (such as Borrelia 
bur gdorferi ) and with invasive infections (such as Staphylococcus aureus . Klebsiella pneumoniae 
and Pseudomonas aeruginosa ). Not included were most viruses that cause infections in tropical 
countries, sequences derived from vaccinia virus and a large number of sequences from E. coli 
(which is part of the normal intestinal flora). When multiple antigenic variants were present, one 

20 or several sequences that best fit the motifs were chosen. The selected peptides were synthesized 
by Pin-Technology on a 1 mg scale (Chiron Mimotopes. San Diego). Seventy peptides were 
made which fit motifs #1 and #2 and 59 peptides for motif #3. 

These peptides were then tested for their ability to activate human MBP(85-99) specific T 
cell clones that had been previously established from blood T cells of two patients with 

25 relapsing-remitting MS (Wucherpfennig et aL 1994a: Wucherpfennig et ah. 1994b). 

Homozygous B cell lines that expressed DR2 (DRB1* 1 501 or DRB1 * 1602) or DQ1 were used 
as antigen presenting cells (APCs) in these T cell proliferation experiments (Wucherpfennig et 
aL 1994a). As a positive control, all clones were shown to be activated by the MBP(85-99) 
peptide. Seven clones were tested with the viral/bacterial peptides selected according to the 

30 sequence motifs. 

Three of the seven clones tested were efficiently activated by several viral/bacterial 
peptides. The first clone (Hy.lBl 1) was HLA-DQ1 restricted whereas the other two clones 
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(Hy.2El 1 and Hy.lGl 1) were HLA-DR2 restricted (Wucherpfennig et al.. 1994a). Among the 
70 peptides selected according to motifs #1 and #2, three mimicry peptides stimulated the DQ1 
restricted clone and two peptides stimulated both of the DR2 restricted T cell clones. Among the 
group of 59 peptides selected according to motif #3, one peptide was identified for the DQ1 

5 restricted clone while two were identified for the DR2 restricted clones. 

Taken together, the DQ1 restricted T cell clone recognized five structurally related 
peptides: the immunodominant MBP(85-99) peptide, three viral peptides (from Herpes simplex. 
Adenovirus type 1 2 and Human Papillomavirus) and a bacterial peptide ( Pseudomonas 
aeruginosa s. Two of the DR2 restricted clones were activated by four peptides. Both clones 

l o recognized the MBP(85-99) peptide as well as peptides from EBV and Influenza virus. In 
addition, one clone recognized a viral peptide from Reovirus (clone Hy.2El I ) and one 
recognized a peptide from Herpes simplex virus (clone Hy.lGl 1). These results, and the 
sequences of these peptides, are summarized below in Table 2. 

TABLE 2 

1 5 Peptides Recognized by Clone Hy. 1 B 1 1 (DQ1 Restricted): 

MBP(85-99) ENPVVHFFKNIVTPR 
Herpes Simplex. UL1 5 Protein FRQLVHFVRDFAQLL 
Adenovirus Type 1 2 r ORF DFEVVTFLKDVLPEF 
Pseudomonas aeruginosa. Phosphomannomutase DRLLMLFAKDVVSRN 

20 Human Papillomavirus Type 7. L2 Protein 1GGRVHFFKDISPIA 



Peptides Recognized by Clone Hy.2El 1 (DR2 Restricted): 
MBP(85-99) ENPVVHFFKNIVTPR 
Epstein-Barr Virus. DNA Polymerase TGGVYHFVKKHVHES 
25 Influenza Type A. Hemagglutinin YRNLVWFIKKNTRYP 
Reovirus Type 3. Sigma 2 Protein MARAAFLFKTVGFGG 



Peptides Recognized by Clone Hy.lGl 1 (DR2 Restricted): 
MBP(85-99) ENPVVHFFKNIVTPR 
30 Epstein-Barr Virus. DNA polymerase TGGVYHFVKKHVHES 
Influenza Type A. Hemagglutinin YRNLVWFIKKNTRYP 
Herpes Simplex. DNA Polymerase GGRRLFFVKAHVRES 
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For these viral/bacterial peptides to be involved in the initiation of autoimmunity, they 
have to be capable of potent T cell stimulation that results in marked clonal expansion of 
autoaggressive T cell clones. The stimulatory capacity of each of these peptides was therefore 
compared to the MBP(85-99) peptide in a titration experiment. The peptides were found to be 
5 efficient stimulators of the MBP specific T cell clones; in particular the EBV peptide (DR2 
restricted clones) and the adenovirus peptide (DQ1 restricted clone) were similar to the 
MBP(85-99) peptide in their stimulatory capacity. These results demonstrate that the T cell 
activation is not the result of a minor degree of 'cross-reactivity' but rather the result of structural 
similarity sufficient for potent T cell activation. 
10 Comparison of the peptide sequences that were found to stimulate the same TCR revealed 

several interesting points: 

(1) Only one peptide (Human Papillomavirus L2 protein) had striking sequence similarity 
with the MBP(85-99) peptide in that all amino acids in the MBP(89-95) segment except position 
94 ( Asn to Asp) were identical (Table 2). For all other sequences, simple alignment would not 

15 have predicted them to be efficient stimulators of MBP(85-99) specific T cell clones. Therefore, 
absent the method of the present invention, these peptides would not have been identified. 

(2) At positions not specified by the search criteria, the selection for particular amino acids 
was still apparent (Table 2). For the DQ1 restricted clone, for example, aspartic acid was 
selected at position P6 (residue 94 of MBP), a probable TCR contact residue, in all four peptides. 

20 This position is occupied by asparagine in the MBP peptide (similar size, but no negative 

charge). Substitution of Asn 94 for Asp in the MBP peptide markedly increased its stimulator) 
capacity for the DQ1 restricted clone but reduced it for the DR2 restricted clone. Selection also 
occurred at the neighboring P7 MHC contact (He 95) for which He, Val or Phe were selected (all 
hydrophobic). 

25 (3) Different selection events occurred for the DQ1 and the DR2 restricted clones: At 

position P6 (94 of MBP), there was selection of aspartic acid (negative charge) for DQ1 peptides 
whereas there was selection for lysine (positive charge) in two of the three peptides presented by 
DR2. 

(4) In the flanking segments (residues 85-87 and 97-99). no apparent selection took place 
30 as amino acids with different size and charge were allowed. 

The majority of the viruses from which these sequence peptides were identified are 
common human pathogens: Influenza Type A frequently causes respiratory tract infections: 
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Human Papillomavirus infects epithelial tissues and has been linked to cervical carcinomas: and 
Epstein-Barr Virus (EBV) causes an acute viral syndrome (infectious mononucleosis) in young 
adults. Human Herpesvirus I (Herpes simplex), EBV and Human Papillomavirus cause latent or 
persistent infections with neurons (Herpes simplex), B cells (EBV) and epithelial cells 

5 (Papillomavirus) serving as the reservoirs. Viral expression can be reactivated by UV exposure 
and stress (Herpes simplex) and by B cell activation (EBV) (Schwarz et ah 1 985: Epstein et aL 
1977: Spruance 1985: Tovey et ah. 1978). For the induction and maintenance of an autoimmune 
response these persistent viral infections are of particular interest as they could explain the 
chronicity of the clinical disease and the clonal expansion and persistence of MBP specific T 

10 cells. Reactivation of viral expression may also be involved in triggering clinical relapses. By 
this mechanism, viral peptides could activate resting MBP specific T cells in periphery and allow 
them to invade the CNS. 

Are these foreign epitopes actually presented to autoreactive T cells during a viral 
infection? The peptide from the EBV DNA polymerase allowed this question to be addressed. 

15 In EBV transformed B cells (which were used as antigen presenting cells in the T cell assays) the 
lytic viral cycle is repressed. The DNA polymerase gene is not transcribed 
in this latent state; however, B cell activation results in activation of the lytic cycle and in the 
expression of the DNA polymerase gene (Datta et aL 1980). To examine MHC class II 
restricted presentation of the EBV DNA polymerase, an HLA-DR2" EBV transformed B cell line 

20 (MGAR) and an MHC mismatched control (9001 . HLA-DR1 ) were pretreated for 36 hours with 
phorbol ester which was removed by extensive washing prior to coculture of APC with T cells. 
T cell clones Hy.2El 1 and Hy.lGl L which recognize the EBV DNA polymerase peptide 
presented by HLA-DR2. were activated by a HLA-DR2" EBV transformed B cell line pretreated 
with phorbol ester. This effect was specific because MHC mismatched B cells did not activate 

25 the clones: also, a control clone (Ob. 1 AI 2) that recognized MBP(85-99) but not the EBV peptide 
was not activated. In a separate experiment, T cell activation was blocked by a mAb specific for 
HLA-DR (mAb L243) but not by a mAb specific for HLA-DQ (G2a.5). These results 
demonstrate that the MBP specific T cell clones recognize not only the viral peptide but also 
antigen presenting cells infected with the virus. In vivo this recognition event could lead to 

30 chronic antigenic stimulation of MBP specific T cells as B cell activation results in the 
expression of EBV genes, including the DNA polymerase gene. 
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Finally, the presentation of the viral peptides by different DR2 subtypes was compared to 
determine if they are efficiently presented by the disease associated molecule (DRB 1*1 50! • the 
most common DR2 subtype). The MBP peptide was presented by three of the four DR2 
subtypes; the peptide was not presented by DRB 1 * 1 60 1 which differs from DRB 1 * 1 602 by a 
5 single amino acid substitution (at position DRB67, a possible TCR contact). The two viral 

peptides were presented much better by the DR15 molecules (DRB 1*1501 and 1502 which differ 
only at position DR686) than by DRB 1 * 1 602. This was particularly evident for the influenza 
peptide which only activated the T cell clone when presented by the DRB1 * 1 501/1 502 
molecules but not by DRB1 * 1602. These results indicate that the viral peptides identified herein 
10 are preferentially presented by the MS associated DR2 molecule (DRB 1 * 1 501 ). 

DEFINITIONS 

For clarity of interpretation and to clearly and distinctly point out the subject matter of the 
claimed invention, the following definitions are provided for several terms used in the claims 
15 appended hereto. 

The term "sequence motif," in accordance with description provided herein, means a series 
of restrictions on the residues which may occupy certain relative positions of an amino acid 
sequence. A sequence motif must restrict at least three and preferably four or five positions of an 
amino acid sequence. The relative positions of the first (N-terminal) and last (C-terminal) 

20 restricted amino acid positions shall be separated by at least two but no more than twelve amino 
acid residues. For example. PI and P4 may be the first and last restricted residues and these 
residues are separated by two residues. As another example. P-l and PI 1 may be the first and 
last restricted residues and these are separated by ten residues. Positions between the first and 
last restricted positions may be restricted or unrestricted with the exception that a total of at least 

25 three positions of the motif must be restricted. Of the three positions which must be restricted, at 
least two must be residues corresponding to major MHC binding pockets. If only two of the 
restricted residues correspond to MHC binding residues, the third must correspond to a TCR 
contact residue. Further, at least one of the positions restricted must correspond to either the PI 
or P4 binding position. By "restricted" is meant that at least one. and preferably ten. amino acid 

30 residues shall be excluded from a position. 

An amino acid sequence "corresponds" to a sequence motif if it can be aligned with the 
positions of the sequence motif such that, at each restricted position of the motif, the amino acid 
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sequence includes a residue which is not excluded from that position by the restrictions which 
define the motif. As the restrictions which define the motif are derived from the size and nature 
of the MHC class II binding pockets and, optionally, the TCR contact residues of known 
epitopes, the restricted positions of the binding motif may also be said to correspond to the MHC 

5 binding pockets and TCR contact residues. 

The term "isolated." in reference to a protein or polypeptide, means separated from its 
native or natural chemical microenvironment. Thus, a polypeptide isolated from a bacterium 
shall be in a preparation substantially free of most of the other bacterial polypeptides and. 
similarly, an isolated viral polypeptide preparation shall be substantially free of the other 

10 polypeptides which comprise the virus. 

The term "associated with," as used in connection with a particular HLA-DR protein and 
an autoimmune disease or autoimmune response, shall mean that the protein and the 
disease/response have been positively correlated by clinical or epidemiological studies such that 
the likelihood of developing the disease/response has been shown to be increased by presence of 

15 the protein. 

The term "HLA-DR protein" means the particular protein product of a particular allele of 
the MHC class II HLA-DR gene. A disease associated with an HLA-DR protein is one 
associated with such a particular protein and not merely with the HLA-DR gene locus. 
By the term "human pathogen" is meant a bacterium, virus or protozoon capable of 
20 infecting humans and generating an immune response. The term specifically is intended to 

exclude the bacteria which form part of the normal human intestinal flora. By this term, "normal 
human intestinal flora," is meant the bacteria which normally inhabit the human gut. such as 
Escherichia coli . but which do not normally cause disease. 

By the term "autoreactive." as applied to T cells, is meant T cells from a human which are 
25 activated by a human self epitope. By "activation" of T cells is meant induction to proliferate, 
secrete lymphokines (cytokines) and/or to initiate effector activity (e.g. cytotoxicity). 

By the term "autoantigen" is meant a self protein or polypeptide which includes a "self 
epitope." By "self epitope" is meant that part of an autoantigen which is recognized by T cells 
when bound to and presented by an MHC molecule. 
30 By the term "effective amount." with respect to tolerizing an individual to an antigen, is 

meant an amount of the antigen sufficient to render T cells, otherwise specific for the antigen, 
unresponsive to the antigen when bound to and presented by an MHC molecule. T cells which 
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are unresponsive fail to activate when presented with the antigen for which they are specific. By 
the term "effective amount/ 1 with respect to immunizing an individual to an antigen, is meant an 
amount sufficient to induce an immune response which results in T cells specific for the antigen. 
Typical ranges of dosages are from 1 nanogram/kilogram to 100 milligrams/kilogram or even 
5 500 milligrams/kilogram. Effective amounts will vary according to such factors as age . sex and 
sensitivity to the antigen. 

By the term "core MHC binding residues" is meant the residues of an epitope 
corresponding to the P-l to P9 positions of a peptide bound to a MHC class II molecule, such as 
an HLA-DR molecule. By the term "inner core MHC binding residues" is meant those residues 
10 of an epitope corresponding to the P-l to P6 positions of a peptide bound to a MHC class II 
molecule, such as an HLA-DR molecule. 
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(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

5 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/400,796 

(B) FILING DATE: 07-MAR-1995 

10 (viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: GATES, EDWARD R. 

(B) REGISTRATION NUMBER: 31,616 

(C) REFERENCE/DOCKET NUMBER: H04 98/7015WO 

15 (ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617-720-3500 

(B) TELEFAX: 617-720-2441 



20 (2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

25 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
30 (v) FRAGMENT TYPE: internal 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HOMO SAPIENS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

Ala Thr Gin Lys He Thr Tyr Arg He Ser Gly Val Gly He Asp 
1 5 10 15 



10 (2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

15 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

20 (v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HOMO SAPIENS 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Phe Gly He Phe Val Val Asp Lys Asn Thr Gly Asp lie Asn He 
15 10 15 
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(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HOMO SAPIENS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Leu Asn Ser Lys lie Ala Phe Lys lie Val Ser Gin Glu Pro Ala 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 



(v) FRAGMENT TYPE: internal 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HOMO SAPIENS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Thr Pro Met Phe Leu Leu Ser Arg Asn Thr Gly Glu Val Arg Thr 
15 10 15 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HOMO SAPIENS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 



Cys Glu Cys Asn lie Lys Val Lys Asp Val Asn Asp Asn Phe Pro 
15 10 15 
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(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HOMO SAPIENS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

Ser Ala Arg Thr Leu Asn Asn Arg Tyr Thr Gly Pro Tyr Thr Phe 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HOMO SAPIENS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Gin Ser Gly Thr Met Arg Thr Arg His Ser Thr Gly Gly Thr Asn 
1 '5 10 15 



10 (2) INFORMATION FOR SEQ ID NO:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

15 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

20 (v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HERPES SIMPLEX 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Phe Arg Gin Leu Val His Phe Val Arg Asp Phe Ala Gin Leu Leu 
1 5 10 15 
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(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: ADENOVIRUS TYPE 12 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Asp Phe Glu Val Val Thr Phe Leu Lys Asp Val Leu Pro Glu Phe 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 
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<vi) ORIGINAL SOURCE: 

(A) ORGANISM: PSEUDOMONAS AERUGINOSA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Asp Arg Leu Leu Met Leu Phe Ala Lys Asp Val Val Ser Arg Asn 
15 10 15 



10 (2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

15 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

20 (v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HUMAN PAPILLOMAVIRUS TYPE 7 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

lie Gly Gly Arg Val His Phe Phe Lys Asp lie Ser Pro lie Ala 
15 10 15 
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(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: EPSTEIN- BARR VIRUS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

Thr Gly Gly Val Tyr His Phe Val Lys Lys His Val His Glu Ser 
1 5 io 15 



(2) INFORMATION FOR SEQ ID NO:13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: INFLUENZA VIRUS TYPE A 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

5 

Tyr Arg Asn Leu Val Trp Phe lie Lys Lys Asn Thr Arg Tyr Pro 
15 10 15 

10 (2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

15 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

20 (v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: REOVIRUS TYPE 3 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Ala Arg Ala Ala Phe Leu Phe Lys Thr Val Gly Phe Gly Gly 
15 10 15 
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(2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HERPES SIMPLEX VIRUS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Gly Gly Arg Arg Leu Phe Phe Val Lys Ala His Val Arg Glu Ser 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HOMO SAPIENS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 

Glu Asn Pro Val Val His Phe Phe Lys Asn lie Val Thr Pro Arg 

1 5 10 .15 
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CLAIMS 

We claim: 



1 . An isolated polypeptide consisting essentially of an amino acid sequence selected from the 
5 group consisting of SEQ ID NO.; I, SEQ ID NO.: 2. SEQ ID NO.: 3. SEQ ID NO,: 4. SEQ ID 

NO.: 5. SEQ ID NO.: 6, and SEQ ID NO,: 7. 

2. An isolated polypeptide consisting essentially of an amino acid sequence selected from the 
group consisting of SEQ ID NO.: 8. SEQ ID NO.: 9. SEQ ID NO.: 10. SEQ ID NO.: 11. SEQ ID 

10 NO.: 12. SEQ ID NO.: 13, SEQ ID NO.: 14, and SEQ ID NO.: 15 

3. A pharmaceutical preparation for tolerization comprising a pharmaceutical ly acceptable 
carrier and 

an amount of an isolated human polypeptide effective for tolerizing an individual to an 
1 5 autoantigen. said human polypeptide including an amino acid sequence corresponding to a 
sequence motif for a MHC class II protein, such as HLA-DR; 

wherein said protein is associated with a human autoimmune disease: 
wherein said polypeptide binds to said protein: 

wherein said polypeptide bound to said protein activates autoreactive T cells from a 
20 subject having said autoimmune disease: and 

wherein said polypeptide is a non-collagen and non-myelin basic protein polypeptide. 

4. The pharmaceutical preparation of claim 3 wherein said protein is an HLA-DR4 or HLA- 
PQ1 protein and said autoimmune disease is pemphigus vulgaris. 

25 

5. The pharmaceutical preparation of claim 4 wherein said motif is PV motif #1 . 
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6. The pharmaceutical preparation of claim 4 wherein said amino acid sequence consists 
essentially of an amino acid sequence selected from the croup consisting of SEQ ID NO.: 1 . SEQ 
ID NO.: 2. SEQ ID NO.: 3. SEQ ID NO.: 4. SEQ ID NO.: 5. SEQ ID NO.: 6. and SEQ ID NO.: 
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7. A pharmaceutical preparation for tolerization comprising a pharmaceutical!}' acceptable 
carrier and 

an amount of an isolated human pathogen polypeptide effective for tolerizing an individual 
to said polypeptide, said polypeptide including an amino acid sequence corresponding to a 
5 sequence motif for a MHC class II protein, such as HLA-DR: 

wherein said protein is associated with a human autoimmune disease; 
wherein said polypeptide binds to said protein; and 

wherein said polypeptide bound to said protein activates autoreactive T cells from a 
subject having said autoimmune disease. 

10 

8. The pharmaceutical preparation of claim 7 wherein said HLA-DR protein is an HLA-DR2 
protein and said autoimmune disease is multiple sclerosis. 

9. The pharmaceutical preparation of claim 8 wherein said motif is selected from the group 
15 consisting of MS motif #1, MS motif #2 and MS motif #3. 

1 0. The pharmaceutical preparation of claim 8 wherein said amino acid sequence consists 
essentially of an amino acid sequence selected from the group consisting of SEQ ID NO.: 8. SEQ 
ID NO.: 9. SEQ ID NO.: 10, SEQ ID NO.: 11. SEQ ID NO.: 12, SEQ ID NO.: 13. SEQ ID NO.: 

20 14, and SEQ ID NO.: 15. 

11. A method of tolerizing an individual to an autoantigen of pemphigus vulgaris comprising 
administering an effective amount of the pharmaceutical preparation of any one of claims 

4-6 to a subject in need of such treatment. 

25 

12. A method of tolerizing an individual to a foreign antigen implicated in multiple sclerosis 
comprising 

administering an effective amount of the pharmaceutical preparation of any one of claims 
8-10 to an individual in need of such treatment. 

30 

13. A pharmaceutical preparation for vaccinating an individual at risk of an autoimmune 
disease comprising a pharmaceutical ly acceptable carrier and 
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an amount of an immunogenic preparation effective to immunize against a human 
pathogen that in its native form includes a polypeptide having an amino acid sequence 
corresponding to a sequence motif for a MHC class Ii protein, such as a HLA-DR: 

wherein said protein is associated with said autoimmune disease: 
5 wherein said polypeptide binds to said protein: 

wherein said polypeptide bound to said protein activates autoreactive T cells from a 
subject having said autoimmune disease; and 

wherein said preparation is free of a polypeptide corresponding to said sequence. 

10 14. The pharmaceutical preparation of claim 13 wherein said protein is an HLA-DR4 or HLA- 
PQ! protein and said autoimmune disease is pemphigus vulgaris. 

1 5. The pharmaceutical preparation of claim 14 wherein said motif is PV motif #1 . 

15 16. The pharmaceutical preparation of claim 14 wherein said amino acid sequence consists 
essentially of an amino acid sequence selected from the group consisting of SEQ ID NO.: I. SEQ 
ID NO.: 2. SEQ ID NO.: 3. SEQ ID NO.: 4. SEQ ID NO.: 5. SEQ ID NO.: 6. and SEQ ID NO.: 

1 7. The pharmaceutical preparation of claim 1 3 wherein said protein is an HLA-DR2 protein 
20 and said autoimmune disease is multiple sclerosis. 

1 8. The pharmaceutical preparation of claim 1 7 wherein said motif is selected from the group 
consisting of MS motif #1. MS motif #2 and MS motif #3. 

25 1 9. The pharmaceutical preparation of claim 1 7 wherein said amino acid sequence consists 
essentially of an amino acid sequence selected from the group consisting of SEQ ID NO.: 8. SEQ 
ID NO.: 9. SEQ ID NO.: 10. SEQ ID NO.: 11. SEQ ID NO.. 12. SEQ ID NO.: 13. SEQ ID NO.: 
14. and SEQ ID NO.: 15. 

30 20. A method of vaccinating an individual at risk of pemphigus vulgaris comprising 
administering an effective amount of the pharmaceutical preparation of any one of 
claims 14-16. 
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2 1 . A method of vaccinating an individual at risk of multiple sclerosis comprising 
administering an effective amount of the pharmaceutical preparation of any one of 

claims 17-19. 

22. The pharmaceutical preparation of claim 1 7 wherein said pathogen and said peptide 
are selected from the group consisting of the respective pairs Herpes simplex virus and UL1 5 
protein. Herpes simplex virus and SEQ ID NO.: 8, Adenovirus and Adenovirus ORF protein. 
Adenovirus and SEQ ID NO.: 09. Pseudomonas aeruginosa and phosphomannomutase 
protein. Pseudomonas aeruginosa and SEQ ID NO.: 10. Papillomavirus and L2 protein. 
Papillomavirus and SEQ ID NO.: 1 K Epstein-Barr virus and DNA polymerase protein. 
Epstein-Barr virus and SEQ ID NO.: 12. Influenza virus and hemagglutinin protein. Influenza 
virus and SEQ ID NO.: 13. Reovirus and sigma 2 protein. Reovirus sand SEQ ID NO.: 14. 
Herpes simplex virus and DNA polymerase, and Herpes simplex and SEQ ID NO.: 1 5. 

15 23. A method of evaluating a peptide for an ability to induce an autoimmune response 

comprising the steps of 

(1) choosing an MHC class II molecule, such as HLA-DR. associated with said 
autoimmune response, said MHC molecule having major MHC binding pockets designated 
PI . P4. P6. P7 and P9. each said pocket binding an amino acid residue at a corresponding 

0 relative position. Px, of an epitope; 

(2) selecting a first major MHC binding pocket Pi and a second major MHC binding 

pocket Pj: 

(3) identifying a first set of amino acid residues which bind within said first pocket, 
identifying a second set of amino acid residues which bind within said second pocket: and 

Z5 (4 ) comparing an amino acid sequence of said peptide to a sequence motif wherein 

said motif includes said first set of amino acid residues at a relative position Pi and said 
second set of amino acid residues at a relative position Pj. 
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24. The method of claim 23 

wherein said epitope is an epitope of known amino acid sequence and wherein said 
epitope has TCR contact residues designated P-l . P2. P3. P5. P8 and PI 1 : 
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wherein step (2) further comprises selecting a first TCR contact point at a relative 
position Pk of said epitope and wherein step (3) further comprises identifying a third set of 
amino acid residues which bind a TCR at said contact point, and 

wherein said motif includes said third set of amino acid residues at a relative 
5 position Pk. 

25. The method of any one of claims 23 and 24 wherein said first binding pocket is a PI 
pocket and said second binding pocket is selected from the group consisting of a P4 pocket 
and a P6 pocket. 

10 

26. A method of identifying foreign antigens implicated in human autoimmune response 
comprising the steps of 

(1 ) choosing an MHC class II molecule, such as HLA-DR. associated with said 
autoimmune response, said MHC molecule having major MHC binding pockets designated 

15 PI, P4. P6. P7 and P9, each said pocket binding an amino acid residue at a relative position 
Px of an epitope: 

(2) selecting a first major MHC binding pocket Pi and a second major MHC binding 
pocket Pj; 

(3) identifying a first set of amino acid residues which bind within said first pocket. 
20 identifying a second set of amino acid residues which bind within said second pocket: 

(4) defining a sequence motif wherein said motif includes said first set of amino acid 
residues at a relative position Pi and said second set of amino acid residues at a relative 
position Pj: 

(5) identifying a set of human pathogen peptide sequences corresponding to said 

25 motif. 

27. The method of claim 26 further comprising the step of excluding from said set 
sequences from at least one species in a normal human intestinal flora. 

30 28. The method of claim 26 further comprising the step of excluding from said set 
sequences from at least one species of pathogen negatively correlated with the incidence of 
said response. 
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29. The method of claim 26 wherein step (5) includes a search of a computer database 
using said motif as a search criterion. 
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